NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Nearly Minimax Optimal Submodular Maximization with Bandit Feedback

Tajdini, Artin; Jain, Lalit; Jamieson, Kevin (December 2024, Curran Associates, Inc.)
Globerson, A; Mackey, L; Belgrave, D; Fan, D; Paquet, U; Tomczak, J; Zhang, C (Ed.)
Full Text Available
Minimax Optimal Submodular Optimization with Bandit Feedback

Tajdini, Artin; Jain, Lalit; Jamieson, Kevin (December 2024, Neural Information Processing Systems)

Full Text Available
Fair Active Learning in Low-Data Regimes

Romain, Camilleri; Jain, Lalit; Jamieson, Kevin; Morgenstern, Jamie (July 2024, Uncertainty in Artificial Intelligence)

Full Text Available
Optimal Exploration is no harder than Thompson Sampling

Li, Zhaoqi; Jamieson, Kevin; Jain, Lalit (May 2024, International Conference on Artificial Intelligence and Statistics)

Full Text Available
Optimal Exploration is no harder than Thompson Sampling

Li, Zhaoqi; Jamieson, Kevin; Jain, Lalit (May 2024, Proceedings of Machine Learning Research)

Full Text Available
A/B Testing and Best-arm Identification for Linear Bandits with Robustness to Non-stationarity

Xiong, Zhihan; Camilleri, Romain; Fazel, Maryam; Jain, Lalit; Jamieson, Kevin (May 2024, AISTATS)

Full Text Available
A/B Testing and Best-arm Identification for Linear Bandits with Robustness to Non-stationarity

Xiong, Zhihan; Camilleri, Romain; Fazel, Maryam; Jain, Lalit; Jamieson, Kevin (May 2024, International Conference on Artificial Intelligence and Statistics)

Full Text Available
Humor in AI: Massive Scale Crowd-Sourced Preferences and Benchmarks for Cartoon Captioning

https://doi.org/10.52202/079017-3978

Chen, Jiayi; Guo, Yang; Jain, Lalit; Jamieson, Kevin; Mankoff, Robert; Nowak, Robert; Rogers, Timothy; Sievert, Scott; Suresh, Siddharth; Wagenmaker, Andrew; et al (January 2024, Neural Information Processing Systems Foundation, Inc. (NeurIPS))

Full Text Available
Nearly Optimal Algorithms for Level Set Estimation

Mason, Blake; Jain, Lalit; Mukherjee, Subhojyoti; Camilleri, Romain; Jamieson, Kevin; Nowak, Robert (March 2022, International Conference on Artificial Intelligence and Statistics)

The level set estimation problem seeks to find all points in a domain  where the value of an unknown function 𝑓:→ℝ exceeds a threshold 𝛼 . The estimation is based on noisy function evaluations that may be acquired at sequentially and adaptively chosen locations in  . The threshold value 𝛼 can either be explicit and provided a priori, or implicit and defined relative to the optimal function value, i.e. 𝛼=(1−𝜖)𝑓(𝐱∗) for a given 𝜖>0 where 𝑓(𝐱∗) is the maximal function value and is unknown. In this work we provide a new approach to the level set estimation problem by relating it to recent adaptive experimental design methods for linear bandits in the Reproducing Kernel Hilbert Space (RKHS) setting. We assume that 𝑓 can be approximated by a function in the RKHS up to an unknown misspecification and provide novel algorithms for both the implicit and explicit cases in this setting with strong theoretical guarantees. Moreover, in the linear (kernel) setting, we show that our bounds are nearly optimal, namely, our upper bounds match existing lower bounds for threshold linear bandits. To our knowledge this work provides the first instance-dependent, non-asymptotic upper bounds on sample complexity of level-set estimation that match information theoretic lower bounds.
more » « less
Full Text Available
Instance-optimal PAC Algorithms for Contextual Bandits

Li, Zhaoqi; Ratliff, Lillian; Nassif, Houssam; Jamieson, Kevin; Jain, Lalit (January 2022, Advances in neural information processing systems)
Koyejo, S.; Mohamed, S.; Agarwal, A.; Belgrave, D.; Cho, K.; Oh, A. (Ed.)
In the stochastic contextual bandit setting, regret-minimizing algorithms have been extensively researched, but their instance-minimizing best-arm identification counterparts remain seldom studied. In this work, we focus on the stochastic bandit problem in the (ǫ, δ)-PAC setting: given a policy class Π the goal of the learner is to return a policy π ∈ Π whose expected reward is within ǫ of the optimal policy with probability greater than 1 − δ. We characterize the first instance-dependent PAC sample complexity of contextual bandits through a quantity ρΠ, and provide matching upper and lower bounds in terms of ρΠ for the agnostic and linear contextual best-arm identification settings. We show that no algorithm can be simultaneously minimax-optimal for regret minimization and instance-dependent PAC for best-arm identification. Our main result is a new instance-optimal and computationally efficient algorithm that relies on a polynomial number of calls to an argmax oracle.
more » « less
Full Text Available

« Prev Next »

Search for: All records